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NON-TRIVIAL MATRIX ACTIONS PRESERVE NORMALITY FOR 

CONTINUED FRACTIONS 

JOSEPH VANDEHEY 


Abstract. A seminal result due to Wall states that if x is normal to a given base b then 
so is r* + s for any rational numbers r, s with r 7 ^ 0. We show that a stronger result is 
true for normality with respect to the continued fraction expansion. In particular, suppose 
a,b, c, d £ Z with ad —be 7 ^ 0. Then if x is continued fraction normal, so is [ax-\-b)/[cx + d). 


1. Introduction 

A number x G [0,1) with base 10 expansion x = 0.010203 ... is said to be normal (to 
base 10 ) if for any finite string s = [ci, C 2 ,..., c^] we have that 

1 . #{0 < i < n : Oj+i = c,-, 1 < ?’ < A:} 1 

hm - - - - -= —r. 

n-^-cxD n lO*' 

Although almost all real numbers are normaQ, we still do not know of a single commonly 
used mathematical constant, such as tt, e, or ^/2, that is normal. 

In his Ph.D. thesis under D.H. Lehmer, Donald Dines Wall [22] proved a series of results 
on normal numbers which are now considered classical, elementary facts. Among them. 
Wall proved that if x is normal then qx + r is normal for any rational numbers q, r with 
g / 0. Chang [7| appears to have discovered this result independently, while Doty, Lutz, 
and Nandakumar [ 8 | knew of Wall’s result and reproved it by a different method. Aistleitner 
[2| has given the only significant extension of Wall’s result the author is aware of, showing 
that if X is normal and y G M is a number with almost all of its digits equal to 0, then 
X + ry is normal to base 10 for any rational r. (See also jO] pp. 97].) 

Although the definition of normality is easily extended to many other digital systems, 
questions about which operations preserve normality are still unanswered in most cases. 
Recently, the author, with Airey and Mance, [1] studied how rational multiplication and 
addition act for Q-Cantor series expansions. 

However, in this paper, we shall be interested in normality for continued fraction expan¬ 
sions, which we shall abbreviate as CF-normality and define explicitly in a moment. Mendes 
France first asked the question of which operations preserve CF-normality |15l pp. 17-18]. 
He actually asked a simpler question, namely if non-zero rational multiplication preserves 
simple normalitjH for continued fractions. Yann Bugeaud extended the question to ask if 
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follow if we replaced 10 by any integer base b > 2. 
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non-zero rational multiplication preserved CF-normality [ 6 l Problem 10.56, pp. 222], Part 
of the difficulty of proving such a result comes from the fact that rational multiplication 
and addition are operations that are not very well understood for continued fractions. Re¬ 
search on these topics appears to have come almost completely from a computational side 
(“Given a continued fraction expansion x, how can we quickly compute the continued frac¬ 
tion expansion of qx -|- r?”). Notable works include Gosper Raney [18], and Liardet 
and Stambul m- On the theoretical side (“If x has a continued fraction expansion with 
property Y, does qx + r have property Z7”), the author is unaware of any significant result. 

We recall some standard definitions for continued fractions. The (regular) continued 
fraction expansion of a number x G M is given by 


1 

X = ao H- - -, with ao £ Z and a* G N for z G N. 

ai H-;- 

a2 + ... 

We will denote this expansion by (ao; ai, 02, ...) for typographical simplicity. This expan¬ 
sion is infinite if and only if x is irrational. We will refer to the nth digit of the continued 
fraction expansion of x by a„,(x) or just a„ if the choice of x is clear. The Gauss map 
T : [ 0 , 1 ) [ 0 , 1 ) given by 

2 ^ 3 . = / 5 - LiJ ’ ^ ^ 0 ’ 

|0, X = 0, 

acts as a forward shift on the continued fraction digits, ignoring oq. The Gauss measure /x 
given by 

'■<•"> = /. (log 2 K 1 

is a probability measure, preserved by T, and is ergodic with respect to T. Given a string 
s = [ci, C2, ..., Cfc], we define the cylinder set of s to be 

Cs = {x G [0, 1) : aj(x) = Cj, 1 < z < k}, 


and we say this cylinder set has rank k. We shall also need the usual matrix action on real 
numbers given by 

/a /3 \ _ ax + P 
y 7 ^ ) 'yx + 5 

With these definitions in mind, we say that a point x G [0,1) is GF-normal if for any 
string s, we have 


( 1 ) 


#{0 < i < n : r*x G Cg} 

hm - 

n—>-oo Tl 


KCs). 


Since T*x G Cg if and only if the string s appears in the continued fraction expansion of x 
starting at the z -|- 1 st position, the left-hand side of ([T]) represents the limiting frequency 
with which s appears in the continued fraction expansion of x. By the pointwise ergodic 
theorem, almost all numbers x G [0,1) are GF-normal. We extend the definition of GF- 
normal to all X G M to say x is GF-normal if x — ao(x) is GF-normal. 

Our main result will be the following, which not only answers Bugeaud’s question in 
the affirmative, but shows that any non-trivial linear fractional transformation preserves 
GF-normality. 
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Theorem 1.1. Let M be a 2 x 2 matrix with coefficients in Z and non-zero determinant. 
Let X G M 6e CF-normal. Then Mx is also CF-normal. 


For a very specific class of matrices M, the above theorem is trivial. In particular, if 
X = (0; oi, 02 , ...) is CF-normal, cq G Z, and ci, C 2 , ..., G N, then 


/ 1 Co \ / 0 1 W 0 1 

VO I J \ l Cl J \1 C2 


0 1 
1 Cfc 


X = (co; Cl, C 2 , . . . , Cfc, Oi, 02 , ...). 


It is easy to see that this preserves normality. However, all the matrices on the left here have 
determinant ±1, and thus so does their product. In fact, for any matrix with determinant 
±1, the action of the matrix on an irrational number x will alter the head of the expansion 
and leave the tail unchanged (see O Theorem 2.37]), thus preserving CF-normality. 

We emphasize that Theorem 11.11 works for any matrix that does not have determinant 
zero, not just those with determinant ±1. The reason why we exclude matrices with 
determinant zero is that in such a case Mx will always be the same rational number, 
regardless of which x is chosen. 

Due to earlier work of Kraaikamp and Nakada m and the author [2T] , we know that nor¬ 
mality for regular continued fraction expansions—the kind we are studying in this paper—is 
equivalent to normality for nearest-integer continued fractions and continued fractions with 
odd partial quotients. Thus Theorem 11.11 holds for these expansions as well. 


1.1. The idea and outline of the proof. Let us return to the question of normality to 
base 10 and give a glimpse into why Wall’s result is true. 

Given a number x that is base 10 normal, how often do we expect to see the digit string 
7 appear in the base 10 expansion of 2x7 We should see a 7 appear in the nth position 
of 2x whenever we see one of the strings 35, 36, 37, 38, 39, 85, 86, 87, 88, or 89 appear 
starting in the nth position of x. We call these strings trigger strings for the string 7. But 
since x is normal, each of these strings appears with limiting frequency 1/100 and there 
are 10 of them, so we expect to see 7 appear with limiting frequency 1/10. In this case, 
understanding how often trigger strings occur relies on knowing how the sequence (10"'x) 
is distributed modulo 1. 

A slightly harder problem: if x is base 10 normal, then how often do we expect to see 7 
in the base 10 expansion of x/3? Here, to determine what the nth digit of x/3 is, we must 
not only know what the nth digit of x is, we must know something about all the first n — 1 
digits. In particular, the nth digit of x/3 is 7 if the nth digit of x is 1, 2, or 3, and the sum 
of the first n — 1 digits of x is 2 modulo 3. If one could show that each of these options 
appeared with limiting frequency l/3oll that would give the desired limiting frequency for 
the string 7 in the base 10 expansion of x/3. 

This suggests that to show that division by 3 preserves normality to base 10, we want to 
understand how the pairs 

(2) I 10"'x (mod 1), 


^This is possible using the techniques of |21| . 


n—1 


^^Oi(x) (mod 3) 


2 = 1 
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distribute in the set [0,1) x {0,1, 2}. (Here aj(x) is referring to the zth digit of the base 10 
expansion, not the continued fraction expansion, of x.) 

We will run into a similar difficulty with continued fractions. What we will want is some 
way to examine the tail of the expansion of Mx, and to do this, we want to be able to push 
the matrix M through the first n digits of the continued fraction expansion for x like so: 


( 3 ) 


M 


fli + 


02 H-h 


= bo + 


bi + 


an + T^xJ 


62 + • • • + 


bm + Mn (T^X) 


with Mn belonging to some set Ai. Our analogy to ([2]) will be the sequence (T^x, M„) and 
we want to show this distributes nicely in the space [0, 1) x Ai. 


As an explicit example, suppose M 


1 0 
0 2 


. Then 


Mx = -X = - 

2 2 oi + Tx 


1 

2ai + 2{Tx) 


By reinterpreting this in terms of matrices, we get that 


1 

0 


0 

2 


0 1 
1 Ol 


0 1 \ / 2 0 \ 

1 2ai y 0 1 y ■ 


If instead, M 
ai(x): 


2 0 
0 1 


then there 


are more possibilities, depending on the value of 


f 2 0 \ f 0 1 

V 0 1 y V 1 2n 

/ 2 0 \ / 0 1 
VO lyVi 1 
2 0 \ / 0 1 
0 1 y V 1 2n + 1 


0 1 
1 n 

0 2 
1 1 

0 1 
1 n 



0 

2 


, and 



The statement of Lemma [Q will essentially say that if we started with a “nice” matrix 
M, then we can in fact choose the matrices in (l3|) to always belong to a particular hnite 
set A4; and in Remark 12.41 we show that it suffices to prove Theorem 11.11 when M G Al. 
Theorem 13 .1 1 then says that the sequence (T"'x, M„) is well-distributed with respect to some 
measure, provided M starts in a subset of Ai with good properties. In Section 01 we again 
show that it suffices to prove Theorem 11.11 when M is in such a subset. Finally, in Section 
[5l we will define exactly what it means to be a trigger string, and from here complete the 
proof of Theorem 11.11 in Section [6l 
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1.2. Notations and definitions. We will denote the continued fraction expansion of x 
by (ao; ai, 02 ,...) and the continued fraction expansion of Mx by (bo; bi, b2, ■ ■ ■ }■ We will 
commonly use n denote an index for x (so we might talk about the nth digit of x) while 
we will use m to denote an index for Mx. 

We will often refer to an arbitrary string as s and its digits as [co;ci,C2 ,... ,Cn] or (if 
Co = 0) as [ci, C2 ,..., Cn] (in this latter case, we say that s is proper). Throughout the 
paper, we will always consider cq > — 1 and c* G N for i G N. The length of a string 
s = [co; Cl,..., Cn], denoted by |s|, is n, regardless of the value of cq. The string [c] with no 
semi-colon will always denote the string [0; c]. When we wish to distinguish between strings 
of digits in x and strings of digits in Mx, we will use s = [cq; ci, C2 ,..., c^] for strings in 
X and t = [do; di, <^2 ,..., dm] for strings in Mx. We will also make reference to a string r 
that is a substring in the expansion oi Mx. 

We will use M to denote an arbitrary matrix and Ai to denote a collection of matrices. 
We will let 




1 i 
0 1 


and J = 


0 1 
1 0 


Typically the matrices Ai are denoted by Tj or T*, but we wish to preserve the letter T for 
the Gauss map. 

To any string s = [cq; ci, C2, ..., c^] there is a canonical identification with a matrix 


Ms = AcoJAc.JAc^... JAc„ 


P P' \ 

Q Q' J 


where P/Q = (cq; ci,..., c^-i) and P'jQ' = (cq; ci, C2 ,..., c„), both in lowest terms with 
positive denominator [3l Lemma 2.8]. With this in mind, we may define the concatenation 
of strings s and s' = [cq; c'^,..., c'^,], denoted by ss' or s.s', as the string associated to the 
matrix 

AdgMs' = AcqJA cj^JA c2 ... JAc^A^MA^MA fj^ ... JA^/ ^ 

reduced according to the following rules: if c^ + Cq > 0 then Ac^^A^i^ = ^c„+cf, otherwise 
if Cn + Cq = 0 then Ac„_.^JA^^A^MA^.!^ = ^c„_i+c'- We note that the concatenation 
[—1; 1].[—1; 1] equals the empty string. 

Similarly we may also define the action of a string on a point by 


(4) s.x = MsX = (co;ci,C 2 ,...,Cn + x). 

If X G [0,1) then this amounts to appending the string s to the start of the continued fraction 
expansion of x. It is clear by (jlj) that if x G [0,1) is CF-normal then s.x is CF-normal for 
any string s. 

We shall make use of standard asymptotic notation as well. We will say that /(n) = 
0{g{n)) if there exists a constant C (called an implicit constant) such that |/(n)| < C ■g{n). 
We will say that f{n) x g{n) (with implicit constant C) if /(n) = 0(g{n)) and g(n) = 
0(/(n)) (both with implicit constant C). If we have two k x k matrices Ki,K 2 , then we 
say that Ki x K 2 (with implicit constant C) if (iLi)jj (K 2 )ij for 1 < i,j < k (uniformly 
with implicit constant C). We will say /(n) = o{g(n)) if f(n)/g(n) ^ 0 as n —>■ 00. If 
a variable appears in a subscript of a big-Oh or little-oh, this denotes that the implicit 
constant or rate of decay is dependent on this variable. 
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We will say a vector or matrix is non-negative (or positive) if all its coordinates are 
non-negative (or positive). We will call a vector a probability vector if it is non-negative 
and the sum of its coordinates is 1. 


2. Matrices and resultant 

Given L) € N, let Wf/j denote the set of 2 x 2 matrices 

and satisfy one of the following conditions: 

• 7 = 0, /3 > 0, a, 5 > 0, and (5 < 5. (Type I) 

• (5 = 0, a > 0, /3 ,7 > 0, and a < 7 . (Type II) 

• a = 0, J > 0, /3 ,7 > 0 and 5 < {3. (Type III) 

• /3 = 0, 7 > 0, a, 5 > 0, and 7 < a. (Type IV) 

• a < 0, /3, 7 , J > 0, and |a| < 7 . (Type V) 

• /3 < 0, a, 7 , J > 0, and j/3| < 5. (Type VI) 

It is easy to see from the above conditions paired with determinant requirement that 
|a|, |/3|, I 7 I, |(^| < |D|, and thus Md is a finite set. 

Lemma 2.1. Given any M & M.d and j € N, there exists a matrix M' G A4 d, and integer 
m > 0 and integers do > —1 and di,d 2 , ■ ■ ■ ,dm such that 

( 5 ) MJAj = Ad,JAd,JAd^ ... JAd^M'. 

Moreover, if do = — 1 then m > 1. 


STRINGS 

a (3 
7 6 


that have determinant 


This lemma is similar in statement and purpose to Lemma 6 from Liardet and Stambul 

M- 

We note that we could have that m = 0 and do = 0 so that MJAj = M'. 


Proof. We let M 


" ^ Y Then 
7 <5 y 


MJAj 


(3 a + (3j \ 

S -f + SjJ 


We call this new matrix M_i and denote its coefficients by a_i, /3_i, 7_i, d_i as appropriate. 
First, we claim that we can choose do ^ and create a new matrix 


mo = a^;m_i = 


Q !_1 - do7-i /3-1 - dod_i \ 

7-1 <^-i J ’ 


with coordinates given by ao, /3oMo^ such that either Mo G M.d (in which case the proof 
is complete with Mq = M') or at least one of the inequalities |q;o| < 7 o or |/3o| < do holds 
and at most one of ao or j3o is non-positive. (Since the determinant is non-zero, we cannot 
have both ao and /3o be 0.) We proceed by cases. 

Case 1: M is a Type I or Type IV matrix. In this case, all coefficients of M_i are 
positive, except perhaps for a_i which could equal 0 if we had that f3 = 0. Regardless, we 
may choose 


a_i 

.7-1 


d-i 


( 6 ) 


do = min 
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which will guarantee that either 0 < a_i — do7-i < 7-i and 0 < /?_i — do^-i (if the first 
term is minimal), or else 0 < /3_i — do6-i < and 0 < a_i — do7-i (if the second term 
is minimal). 

Case 2: M is a Type II or Type III matrix. In this case, all coefficients of M_i are 
positive, except perhaps 7_i, which could equal 0 if we had that <5 = 0. If 7_i = 0, then 
we choose do = so that 0 < /3_i — d^S-i < (5_i. (This will be a Type I Matrix, 

finishing the proof in this case.) Otherwise, if 7_i ^ 0, then we choose do as in ([6]) to the 
same results. 

Case 3: M is a Type V matrix. In this case, all coefficients of M_i are positive, except 
for (3-1 which could be 0 or even negative. If /3_i > 0, then all coefficients are positive, 
and we again choose do as in ([6]) to the same results. If /3_i = 0, then we can let do = 0 
as 1/3-1 — dod_i| < <5-1 is trivially true in this case and a_i — do7-i will be positive. If 
j3-i < 0, then |/3_i| < |a| < 7 < <5_i, so M_i is already a Type VI matrix and we are done 
in this case. 

Case 4'- M is a Type VI matrix. In this case, 7_i,<5_i > 0, a_i < 0 and /3_i could be 
zero, positive, or negative. If /3_i is positive, then since \j3\ < 6, we have that |q;-i| < 7-1, 
we have that M_i is a Type V matrix and we are done in this case. So suppose /3_i < 0. 
Then since |/3| < d by assumption, we have that |/3-i| = |a + /3j| < |/3j| < \6j\ < d_i. Thus 
we have a_i < 0, /3_i < 0 and |a-i| < 7-1 and |/3-i| < d_i. So if we choose do = —1, we 
have 

a_i + 7-1 /3-i + d_i 

7-1 


Mo = 


with all the coefficients positive and 0 < ao < 70) satisfying the desired conditions. (Note 
that in this case, we do not have that Mq ^ which, with the work below, will guarantee 
the final line of the statement of the lemma.) 

Now we may assume that at least one of the inequalities |ao| < 7o or |/3o| < do holds and 
at most one of ao or /do is non-positive. 

If Mq € Ml d, then we are done. Note that if Mq has either ao or (Iq be negative, then it 
automatically must be a Type V or Type VI matrix. Therefore, let us suppose that Mq is 
not in Aif), so that all coordinates will be non-negative, with at most one 0 at ao or /do. 
Then we choose 


di = min 


Qq 

.70. 


.<^0. 


so that the matrix Mi = A'^MMq with coordinates ai,/3i,7i,di satisfies |ai| < 71 or 

l, 011 < di with at most one of ai or /3i is non-positive. If Mi is not in Ain, then we may 
choose d2 > 1 in the same way that we chose di and construct the matrix M 2 = A-,^JMi 
satisfying an analogous property, and so on. 

At some point as we iterate this procedure we must reach a matrix Mm & A4d- at every 
stage, as we go from the matrix Mi to the matrix Mj+i, we rearrange the coordinates and 
subtract at least 1 from one of them (no coefficients grow in size), and we know that the 
moment we hit a negative coefficient we will have a Type V or Type VI matrix, which is 

m. AId- Thus after some finite number of steps we must arrive at a matrix Mm G Aio- 

This completes the proof. □ 
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Lemma 2.2. In Lemma, \2.1\. the integer m is uniformly bounded by a constant dependent 
on D. 


Proof. Let ([5]) holds for some matrices M, M' and integers j,do,di,... ,dm. Let the coor¬ 
dinates of M and M' be a,l3,'y,5 and respectively. 

Let P/Q = {do;di,d 2 ,...,dm-i) and P'jQ' = {dQ]di,d 2 ,...,dm)- We may assume, 
without loss of generality, that m > 2, so that both Q and Q' are positive. 

By ([5]), we have for any x G M with x ^ —(7 -|- j(5)/(5 that 

( P P' \ 

MJAjX = Ad^JAd^ ... JAd^M'x = ( g qi j M'x. 


Allowing x to tend to infinity implies that 

/3 f P P' \ a' Pa' + P'i 

'd~\Q Qa' + QY’ 


where if any denominator equals 0 , we take the corresponding fraction to be equal to infinity. 
We prove the lemma by cases. 

Case 1: <5 = 0, a' = 0. In this case, we cannot also have that 7 q = 0 as M' has a non-zero 
determinant, so we must have that Q' = 0. However, we already know that Q' is positive, 
so this case does not occur. 

Case 2: 5 = 0 , a' / 0 . In this case, we must have that QjQ' = —^'ja'. But it is well- 
known that Q/Q' = (0; dm, dm-i Ym- 2 ) • ■ ■ Yi) (see [3l pp. 32]). There are only finitely 
many choices for —^'ja', since is a hnite set, so m must be uniformly bounded in this 
case. 

Case 3: <5 7 ^ 0, 7 ' = 0. In this case, since M' has a non-zero determinant, we have that 
a' cannot be 0. Thus, fi/d = P/Q = (do; di, d 2 ,..., dm-i)- Again, there are only finitely 
many possible choices for the rational number /3/d, so m must be uniformly bounded in 
this case. 

Case f: d 7 ^ 0, 7' / 0 Let a'/^' = (dg; d'^,..., d'^,). By the definition of M.d, we must 
have dg > —1. If dm + dg > 1, then we have 


13 _ Pa' + P'i 
d Qa' -I- Q'Q 


(do, di,..., dfYi “t" dg, d ^, d 2 ,..., d^/). 


There are again only finitely many possible choices for /3/d and for each choice, the continued 
fraction expansion is completely hxed (up to the last digit). Likewise with a'/Q. Thus, 
for each of these finitely many choices, we must have that the sequence {do, di,..., dm} is 
fixed, and hence m must be uniformly bounded. 

If dm + dg = 0, then 

^ _ Pa' + P'i 
d Qa' + Q'^' 

The same argument applies here. □ 


(do; di,..., dm —1 T d]^, d 2 ,..., d,^/). 


By Lemma l2.ll we can construct functions R{[j],M), U{[j],M) that act on length-one 
strings of positive digits and matrices M ^ M.d such that R{[j], M) = [do; di, d 2 ,..., dm] 
and U{\j],M) = M' for a sequence d,, 0 < i < m, and M' satisfying ([5]). These functions 
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need not be unique. We can then extend these functions to act on arbitrary proper strings 
s in their first coordinate. We let 

(7) C7([ci, C 2 , ...,Cn],M) = [/([C 2 , ..., c™], t/([ci], M)) 
and 

(8) R([ci,C2, ... = R{[ci],M).r(j^C 2 ,C 3 , ... ,Cn],U{[ci],M)y 

The string R{s, M) is called the resultant string of s and M. With these definitions, if 
X = (ai, 02 , 03 ,...), M G then by applying ([5j) twice, we have that 

Mx = M (oi, 02 ,..., o„ + r”x) 

= MAa,Aa,...Aa^{T^x) 

= i?([oi], M).(l7([oi], M)A,, ... AaAT^x)) 

= 7?([oi],M).(i2([o2],[/([oi],M)).([/([o2],7/([oi],M))A,3...A,„(r-x))) 

= 7?( [oi, 02 ], M). ([/([oi, 02 ], M)A1,3 ... A, jT”x)) 

where the last equality follows by d?]) and ([8]). By applying ([5]) another re — 2 more times, 
we see that 

(9) Mx = i?([oi, 02 ,... ,an],M).(u{[ai,a 2 ,... ,a„],M) {T'^x)'^. 

As noted earlier, the functions R and U need not be unique, but we will assume through¬ 
out the rest of the paper that we are referring to the same function R and U every time. 

Lemma 2.3. There are only finitely many pairs ([j],M), j G N, M G whose resultant 
string is [—1; 1]. 

Moreover, suppose we have a pair ([ci,C 2 ,... ,Cfc],M), Ci £ N, M £ Md, such that for 
all i in the interval 1 < i < k, we have R{[ci], U{[ci, C 2 , ■ ■ ■, Cj_i, M)) = [—1; 1]. Then k is 
uniformly bounded by a constant dependent on D. 

Proof. Suppose, as in Lemma l2.ll that we have MJAj = A-iJAiM'. By rewriting this, 
we obtain Aj = JM~^A-iJAiM'. (Note that M~^ may not have integer coefficients, but 
the overall matrix must.) Since j is completely determined by M and M' and there are 
only finitely many possibilities for the two matrices as both are in Adu, the first part of 
the lemma is immediate. 

For the second part, it suffices to show that k cannot be a large even number. So suppose 
k is even. Then we have 

MJA,^JAc,... JAc, = (A_i JAi)^ M', 
but (A_i JAi)^ = I, so we have 

MJA,,JA,,... JA,, = M'. 

By standard facts about continued fractions, all the coordinates of JA^JAc^ ■ ■ ■ JA^f. should 
have size at least on the order of the /cth Fibonacci number (see [H Lemma 2.9] and note 
that all > 1), and since the bottom row of M consists of at least one positive value and 
one non-negative value, the coordinates in the bottom row of M' must have size at least 
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on the order of the kth Fibonacci number. But since the coordinates of M' are bounded, 
k must be bounded as well. □ 

Remark 2.4. At this point, we claim that to prove Theorem 11.11 it suffices to show that 
it holds for all CF-normal x G [0,1) and M ^ Md for some G N. First, note that by 
replacing M with MAa^ and x with x — oq, which is CF-normal if and only if x is, we may 
assume without loss of generality that x G [0,1). 

Now consider a matrix M = ^ ^ ^ ^ with | det(M)| = D ^ 0 and let x = (0; oi, 02 ,...) 

be a CF-normal number. Let Pn/q-n = (oo; aij ■ ■ ■ j ^n) be the convergents to x. Note that 
we have 

A 1A 1A 1A — ( 

ri-ao ^ai ’J r±a2 ■■■ a r±a„ ~ I „ „ 

\ yn—1 Hn 

Since {^1 + converges to the irrational number xy -|- 5, we have that for sufficiently 

large number n, the numbers Pn-iA + Qn-i^ and pn7 + QnS will have the same sign. Let 
M_i = MAagJAa^ ... JAa^, SO that the bottom row of M_i has the same sign and 

Mx = M_i {T^x) 

Without loss of generality, we may assume that the bottom row of M_i is strictly positive 
(if not we could multiply both sides by —I, which does not alter the way the matrix M 
acts on points x). We then let do G Z be the smallest integer so that A^^M-i has all 
non-negative coefficients. Now we can apply the Euclidean-type algorithm of the proof of 

Lemma O to A/q — '^do _1; Since this Afo satisfies the same properties as the Afo m 

the proof of Lemma l2.11 (We note that do here could be any integer and not just integers 
greater than or equal to —1; however, this does not impact the rest of the proof.) Therefore 
A'2^AI~^ can be written as 

JAd^JAd^ ... JAd^M' 

for some string [di, ^2; • • •, dm] and M' G A4d- Thus, 

Mx=[do-,di,d 2 ,...,dm].M'{T^x). 

But as we have that x G [0,1) is CF-normal if and only if T^x is CF-normal, and if and 
only if s.x is CF-normal for any string s, it therefore suffices to show that Theorem 11.11 
holds for CF-normal x G [0,1) and M G for some L> G N. 

3. Normality on a larger space 
Here we want to make our first step towards showing that the sequence 

(r’^x,t/([ai,a2,...,a„],Ar)) 

is nicely distributed in some sense. This will allow us to show that the last term of Q, 
which gives the tail of the CF expansion of Mx, is likewise nicely distributed. 

Let n C [0,1) denote the subset of irrational points and let Ad denote some finite set, 
which we will later take to be a set of matrices. We will let x denote elements in 11 and 
M denote elements of Ad. We will consider cylinder sets of 11 to be the intersection of the 
usual cylinder sets (for the continued fraction expansion) of [0,1) with H. 
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We wish to extend the Gauss map T to a transformation T on a larger domain hi = QxAi. 
For any (x,M) E fl, we define 

f{x,M) = {TxJa,{.){M)) 

for some function : Wf —)• Wf that is indexed by the first digit of x. Since the second 
coordinate of T{x,M) only depends on M and the first CF digit of x, we see that this 
second coordinate is constant for all x in the same rank 1 cylinder. Given a cylinder set 
Cs for hi, we call Cg x {M} (for any M E Ai) a cylinder set for Q. Moreover, we define 
]1{E X {M}) = fi{E)/\Ai\ for any measurable subset E of and b E Ai. 

For easier readability, we will use {E,M) to denote E x {M} for any measurable set 
E C 0,, with measurability being determined by Lebesgue measure or, equivalently, the 
Gauss measure. We will also let (E,A4} denote E x Ai. 

We adapt our definition of normality on this space. We will say that {x,M) E is 
T-normal with respect to a measure p on fl, if for any cylinder set {Cg, M') we have 


lim 

n—>-oo 


#{0 < i < n : T^{x,M) E {Cg,M')} 
n 


piCg,M'). 


We say T is transitive if for any Mi, M 2 E Ai, there exists a proper string s of length n 
such that 

T^iCg,Mi) = in,M2). 

The goal of this section is to prove the following result, which is similar to a previous 
result of the author |21] : however, as this paper contains significant departures (notably 
not assuming that the functions fa are bijective and hence not being able to assume that 
p, is T-invariant), we present the proof in full. 


Theorem 3.1. If T is transitive, then there exists a probability measure p on H that is 
absolutely continuous with respect to p and such that T preserves p and is ergodic with 
respect to p. Moreover, if x is CF-normal, then for any M E Ai, the point {x,M) is 
T-normal with respect to p. 

Since almost all numbers x E [0,1) are CF-normal, almost all (x,M) E H are T-normal 
with respect to p. Therefore, calling these points (x, M) “normal” is reasonable to do. 


3.1. Necessary Lemmata for Theorem 13.11 In order to simplify the readability of the 
proof of Theorem 13.11 we will include several lemmas here. All these results make use of 
the dehnitions and assumptions at the start of section [3l 

In order to show that {x,M) is T-normal with respect to p, we will need to make use of 
the Pyatetskh-Shapiro normality criterion in the following form. 


Lemma 3.2. Let {x, M) E If and suppose a measure p exists satisfying the first part of 
Theorem \3. 1[ If for any cylinder set {Cg,M'), we have 

, #{0 < i < n-1 : TTx,M) E (C„M')} 

Inn sup- 

n^oo ^ 

for some uniform constant a, then x is T-normal with respect to p. 


p{Cg,M'). 
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Proof. This is a simple consequence of Theorem 1 in m- We briefly describe how using 
the terminology from their paper. 

We let the family {Cm} denote the family of all cylinder sets on and also let (p{t) = a-t. 
Since the set Ai(T,xi,S) is fixed on rank-/ cylinder sets, we have that H^{Ai{T,xi,^)) 
equals a ■ fj.{Ai{T, xi-, S)) and thus goes to 0 as / ^ oo. □ 

A string s = [ci, C2 ,..., c^] is said to be traversing if for every Mi and M 2 there exists a 
i < n such that 

r {Cs,Mi) c (o,M 2). 

The interesting property of the traversing string is that for any x £ Cg, we have that the 
T-orbit of {x, M) eventually traverses all of Ml in its second coordinate, regardless of which 
M it started with. 

Lemma 3.3. If T is transitive, then a traversing string exists. 

Proof. Let Mi, M2 ,..., M|_yv/(| be the elements of Ai. By the transitivity of T, there exists 
a finite string si such that 

{M : there exists i < |si| such that T*(x,Mi) = {T^x,M) for all x G = Ml, 

that is, that the second coordinates of T^{x, Mi), i < |si| traverses all of Ml. Let M2 G Mi 
be given by tI®iI(x,M 2) = (*,M2) for any x G Again by transitivity, we can find a 
string S 2 such that 

{M : there exists i < |s2| such that T*(x,M2) = {T^x,M) for all x G Csj} = Ml. 

But then by construction, for any x G 6*5^52) the second coordinates of both T®(x;Mi) 
and T^{x; M 2 ), for i < |si| -|- |s2|, traverse all of Ml. In this way we can continue to 
find strings S3, S4,..., S|_yv(|, so that the concatenated string S1S2S3 ... S|_a4| is the desired 
traversing string. □ 

A well-known consequence of Renyi’s condition for continued fraction expansions (see 
Chapter 9 of [20]) states that there exists an absolute constant C > 0 so that for any 
measurable set E and cylinder Cg of rank n, we have that 

(10) ^fi{E)fi{Cg) < fi{T-^E n Cg) < Cfi{E)fi{Cs). 

It is clear that one could replace Cg by any set that can be expressed as a union of rank n 
cylinder sets. 

We will also want a similar equality (sans the cylinder set) to hold for T and jl, for which 
we will require the following key result. 

Lemma 3.4. Let {Kn}f}Li be a sequence of k x k Markov matrices such that Kn^ x 
uniformly for ni,n 2 G N. Assume that there exists a power i such that Kf has all positive 
coordinates, and also assume that > 0 for 1 < i < k. Then there exists an integer 

no G N and a constant c G (0,1) such that for any Ixk probability vector v and any n > uq, 
we have that all coordinates of VK 1 K 2 K 3 ... are in the interval (c, 1 — c). 
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Proof. This is a special case of Proposition 2.13 in [19j : the restriction that Km K^ 
uniformly for ni,n2 € N implies that Kn+iKn +2 ■ ■ ■ Kn+e Kf with an implicit constant 
dependent on i. This allows us to replace the “uniform irreducibility” assumption with our 
simpler condition that there exists a power i such that iPf has all positive coordinates. □ 

Lemma 3.5. Suppose T is transitive, then there exists a constant V > 0 and integer uq G N 
such that for any measurable set E C Q and any n > no, we have that 

(11) ^fi{E) < f,if-^E) < Vft{E). 

Proof. We will make a few definitions to start. Let Sn,i denote the set of cylinders {Cs,M) C 
n such that |s| = n and T'^{Cs,M) = {VL,Mi). We let Aij denote the set of a G 
N such that fa{Mi) = Mj. We also consider a sequence of probability vectors Vn = 
{vn,i,Vn,2, • • •, Vn,\M\}, n > 0 , such that 


^n,2 — 


/i U (C„M) =/i(r-(0,M,)) 

' {Cs,M)€Sr,,i 


For example, vq = {1/|A1|, 1/|A1|,... , 1/|A1|}. 

Our first goal will be to show that there exists a constant c' G (0,1) and integer tiq G N 
so that Vn,i G (c', 1 — c') if n > tiq. 

Consider a sequence of \Ki\ x \Ai\ matrices defined by 


{Kn)i,j ~ .y .^ ( LJ LJ i^s.[a]^^) 


fi Mi) n m. 


fi(T-^{n,Mi 


It is clear by construction that these matrices are stochastic and that Vn+i = VnKn. 

We let = K^j^iK^j ^2 ■ ■ ■ Kn. 

Unfortunately, the matrices Kn are not all the same, so Ko^n represents a Markov chain 
that is time-inhomogeneous. However, they are not far from being time-homogenous. By 
writing the matrix coefficients in a different way and applying (llOp . we see that 


1 




-'n,! 


= -M 

'^n.i 


u 

yiC,,M)€Sr,,i 


CsHT- 


U I • 


M 


—A U (C..M) U (J q.| 


dCs,M)&S„,i 


i Cl G 


!>' I U C[.| 1 . 

I ci^w4.i I 
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with the same implicit constant C as in (jlO|) . Thus x Kn 2 for any ni,n2 G N with 
uniform implicit constant This also implies that for any fixed L, we have that x 
KniL,{ni+i)L ^ n 2 L,{n 2 +i)L Uniformly for any ni,n2 G N with implicit constant 

Since T is assumed to be transitive, we know that for any Mi,Mk G M, there exists a 
cylinder {Cs,Mi) with |s| = I such that T^(Cs, Mi) = {Q,Mj). This implies that {Kf)ij x 
{KQ^l)i^j > 0. In other words, Ki is an irreducible matrix. Also, we should be able to find 
integers • • • A\m\ G N so that > 0. Since all terms of Ki are non-negative by 

construction, if > 0 then we have > 0 for any m G N. Thus, if we let 

L = lcm(£i,£2) • • • then we have that K[ is strictly positive along its diagonal. 

Suppose is itself irreducible. Then since has non-negative coefficients with a 
strictly positive diagonal, there is some power of it such that every coefficient is strictly 
positive (see equation (8.3.5) on page 672 of [E]). We may therefore apply Lemma [3~4l 
to the sequence of matrices {KnL,(n+i)L}^=o- So there exists c' G (0,1) and ng such that 
VnL,i G (c', 1 — c') for all i and all n > Uq. 

Suppose iLf is not irreducible. By Theorem 3.4.5 in [5], since Ki itself is irreducible, 
there exists a permutation matrix P such that 


( 12 ) 


PK^P'^ 


( Cl 0 

0 C2 


0 \ 
0 


\ 0 0 ••• Cr ) 


where the Cj are irreducible matrices. In this case, we would apply Lemma 13.41 to each Cj 
and thus show that there exist c'- G (0,1) and n'- such that VnL,i G (c'-, 1 — c)) for n > n) 
and for indexes i corresponding to the matrix Cj after undoing the permutation. By taking 
c' = min{c'} and ng = max{re'}, we get the same result as in the previous paragraph. 

No column of Ki consists of all O’s (otherwise there would be an M G A4 that is never 
visited, contrary to the transitivity of T), therefore the sum of the coefficients in any column 
vector of KnL,nL+j for j < L is uniformly bounded from below. Thus, we can therefore 
find a constant c and no such that Vn,i G (c, 1 — c) for all i and all n > uq. In particular, 
^ !• 

Now we can prove the desired statement m- It suffices to show the statement is true for 
n > ng and for sets of the form (E, Mi) for some measurable subset E G Q and Mi G A4. In 
this case, we have that T~^{E, Mi) equals the union of {{T~^E)r\Cs, M) for (Cg, M) G Snj 
as defined above. Therefore, by applying (fTn|) . we have 


f,(T-^{E,M,))= ^{{T-^E)nCg,M)= 

- fJ-iE) ^ IJ^{E)Vn,i - ^l{E) 

= \M\-fiiE,Mi)^fi{E,Mi), 


as desired. 


□ 
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3.2. Proof of Theorem l3.1L First, we will show that T is ergodic (despite not necessarily 
being invariant) with respect to fl. 

Suppose, we have a T-invariant subset of fl called E that has non-zero /i-measure. We 
dehne E‘^ = il. \ E. By projecting E onto the hrst coordinate, we see that the set 

{x G n I There exists M G M., with (x, M) G E} 

must have full //-measure, since it is invariant under T and T is ergodic with respect to //. 
Let Em denote the set of x G fl such that (x, M) G E. We claim that 

KEm, M) > 0 

for all M G A^. To show this, let M be fixed and let s be a traversing string, whose 
existence is guaranteed by Lemma [TS) Since E must project onto a full //-measure set in 
fl as described earlier, there must exist at least one M' G M with 

|l[{Cs,M')f^E) >0. 

By the definition of being a traversing string, however, there exists some i < |s| such that 

{Cs,M')cf-^ {n,M), 

which implies that 

0 < /i ((c„ M') nE) <ii (f-^ {n, M)nE'^ =ii (f-^ ((fi, m) n e)'^ . 

But since T is non-singular—that is, the preimage of a null set is itself a null set—so must 
T be. This can be seen by projecting onto the first coordinate again. Therefore, 

0 < // ((fl, M) D E) = jl {Em, M) , 

as desired. 

Now we wish to show that E has a substantial intersection with every cylinder set on 
fl, in particular, by showing that there exists a constant e > 0 so that for all cylinder sets 
{Cs,M), we have 

(13) il{En{Cs,M))>e-Jl{Cs,M). 

(There is no relation between the strings s considered from here on and the traversing string 
considered earlier.) 

Since there are only hnitely many elements in Ai, there must exist e' > 0, such that 
i1{Em') > for all M' G M. Let us now fix an arbitrary cylinder {Cs,M) with n := |s|, 
and let M' be such that T^{Cs,M) = (fl,M'). By applying (fTOll . we have 

~^{E n {Cs, M)) = /2 (s n (f-”(fi, M')) n (C„ m)) 

= /2 {{Cs,M) n f-^ {E n (fl,M'))) 

= /i((C„M)nf-(i?M',M')) 

= p^// [t-^Em' n Cs) > -^^^^{EM')^^{Cs) 
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Therefore, letting e = e'jC gives (fT3]l . 

Since the cylinder sets generate the Borel sets on n, we can find, for any 5 > 0, a set 
Es such that f},{E‘^AEs) < 6 and Es is a disjoint union of a finite number of cylinder sets. 
Therefore, by applying (fT^ . we have 

ll{EnE^) = il{En Es) + 0{6) > efi{E)fi{Es) + 0{6) 

= eil{E)il{E‘^) + 0{6). 

But jl{E n E‘^) = 0 and 5 was an arbitrary positive number. Thus either jl{E) = 0 or 
fi{E^) = 0. Since we know E has positive measure, this therefore implies that E must have 
full measure, and T is ergodic with respect to jl. 

We will now construct a measure p that is absolutely continuous with respect to jl such 
that T is not only ergodic but also invariant with respect to p. 

We define a sequence of measures pn on by 

(14) = - E = /- k E xAiTH) dm- 

k=0 \ k=0 / 

By Lemma 13.51 we can show that 


lim sup — 


n—1 

J2f^{T-’^E)<VjliE), 


k=0 


for any measurable set E. Therefore, by a theorem of Ryll-Nardzewski (see page 683 of 
i), the integrand of (jl4|] converges pointwise to a Li function gA_ almost everywhere, and 
since the integrand is dominated by 1, the integrand must in fact converge uniformly to 
PA almost everywhere. Therefore, we may define p{A) = lim„^ooThe Vitali-Hahn- 
Saks theorem [3] shows that p is in fact a probability measure on Q.. Since pn{T~^E) = 
Pn{E) + 0(l/n) for any measurable set E, we have that p is preserved by T. Likewise, by 
Lemma 13.51 again, we can see that 

(15) ^<pn{E)<Vp{E). 

and thus the same is true if we replace pn by 

Thus it remains to show that if x E n is CF-normal, then (x,M) is T-normal with respect 
to p for any M ^ M.. 

So consider a point x E that is CF-normal. Then for every cylinder Cg and every 
M E A4, we have 


hm 4#{1 <n<N\ T^{x,M) E {Cs,M)} 
N—>-oo N 


hm ^#{1 <n<N\T^xeCs} 

N—^OO I\ 

KCs)- 


Thus, in particular, we have for any M, M' E A4 

lim sup -^#{1 <n<N\ T”(x, M) E {Cs,M')} 
N^oo IV 


*This part of the proof draws heavily on techniques used in the proof of Theorem 5.3.5 in | 12| . 
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< limsup -^#{1 < n < N \ T^{x, M) G {CsiM)} 

= f^iCs) = M') < ^^p{Cs, M'). 

Thus by Lemma 13.21 the points (x,M) for all M £ A4 are T-normal with respect to p. □ 

Remark 3.6. We may consider Lebesgue measure on Q to be the Lebesgue measure on ri 
crossed with the counting measure on Ai. Since the Lebesgue measure and Gauss measure 
on Q are absolutely continuous with respect to one another, and since we have (1151) . we see 
that Lebesgue measure (on O) and p are absolutely continuous with respect to one another 
as well. 


4. Building a dynamical system 

As we hinted at the start of Section [3l we would like to build a dynamical system T 
from Q X M.d to itself by T{x,M) = {Tx,U{[ai], M)). We will use this definition for T 
throughout the rest of the paper; however, it may turn out that this system is not transitive 
and thus Theorem l3.1l mav not apply. Thus we will require the following definition to modify 
our dynamical system. 

We will call a subset C a transitive component if the following conditions are 
satisfied: 

(1) For any proper string s, and any M G Ad^ we have that U{s, M) G Ad'^. 

(2) For any M, M' G Ad^ there exists a proper string s such that U{s, M) = M'. 

Note that any two distinct transitive components of Ad^ must have empty intersection. 

With this definition the transformation T given by T(x,M) = (Tx, U{[a], M)) is transi¬ 
tive on n X M'n for any transitive component Ad^ <Z Mq. 

Lemma 4.1. There exists at least one transitive component of M.d- Moreover, there exists 
a string s such that U{s,M) is in a transitive component for any M G Ad/j (although not 
necessarily always in the same transitive component). 

Proof. Consider a directed graph G whose vertices are matrices M £ Ad d and which has 
an edge from Mi to M 2 if there exists a j G N such that U{[j], Mi) = M2. Note this graph 
has out-degree always at least 1. A subgraph G' of G is said to be strongly connected if for 
any Mi, M 2 in V{G'), the vertex set of G', there exists a path from Mi to M 2 and vice- 
versa. We can partition G into its strongly-connected components, which are the maximal 
strongly-connected subgraphs of G. (Note that if there is a vertex M such that there is no 
path from M to another vertex and back to itself, then M is its own strongly-connected 
component.) Let us call these components Gi, G 2 , ■ ■ ■, Gn- Note that if M G V{Gi), then 
V{Gi) consists of all vertices which are strongly-connected to M. 

Now let us consider another directed graph G whose vertices are Gi,G 2 ,... ,Gn and 
where there is an edge from Gi to Gj with i j \i there exists an edge from some Mj G Gi 
to some Alj £ Gj in the directed graph G. We do not let Q contain an edge which goes 
from a vertex to itself. Note that if there is an edge from Gi to Gj, then by the strong- 
connectivity of these components, there is a path from any Mj G Gi to any Mj £ Gj. We 
see that G cannot have any cycles, as otherwise it would be possible to find matrices in 
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two different strongly-connected components that are strongly connected to one another, 
contradicting the maximality of these components. Thus Q is acyclic. 

Any finite acyclic directed graph must contain at least one sink. We claim that the set 
of vertices of any sink Gi of ^ is a transitive component for Aio- Let us fix a sink Gi and 
let Ai' denote the matrices in V{Gi). The first condition for being a transitive component 
is satisfied because if there existed a proper string s and M E Ai' such that C/(s, M) 0 Ai', 
then there would be at least one path from Gi to another strongly-connected component, 
contradicting the assumption that Gi is a sink. For the second condition, this follows from 
the fact that within any Gi there is a path from any vertex to any other vertex and also to 
itself. 

Now consider all the matrices that do not lie in a transitive component, let us call them 
Ml, M 2 , ..., Mfc. Consider Mi and suppose it is in Gi. As there must be a path from Gi to 
a sink of Q, there exists a string si such that U{si, Mi) is in a transitive component. Now 
consider U{si, M 2 ). Regardless of what matrix in Ain the matrix U{si,M 2 ) happens to 
be, there is, by the same argument, a string S 2 such that U{s 2 ,U{si, M 2 )) = U{siS 2 , M 2 ) 
is in a transitive component. (The string S 2 could be empty if U{si,M 2 ) is already in 
a transitive component.) Likewise there is a string S3 such that C/(siS2S3, M3) is in a 
transitive component, and so on. The desired string s is simply S1S2S3 ... s^. □ 

The second part of Lemma l4 .1 1 allows us to reduce the cases of Theorem ll.il vet further. 
By Bemark l2.4[ it suffices to prove Theorem II.II in the case where x E [0,1) is a CF-normal 
number and M E for some D. Let s be a string satisfying Lemma l4.ll If x is CF- 
normal, its continued fraction expansion contains s. Thus there exists some n such that 
T^{x,M) E n X A4'jj, for some transitive component Ai'j^ C A4 d, and so 

Mx = R{[ai,a 2 ,.. . ,a„],M). (17([ai,a2,.. . ,an], M){T'^x)), 

with U{[ai,a 2 , ..., o^], M) E Ai'^. As noted in Remark 12.41 the action of T and the action 
of strings both preserve CF-normality and CF-nonnormality, so it suffices to assume that 
X E [0,1) is CF-normal and that M is in some transitive component A4^. 

Remark 4.2. The dynamical system discussed above bears a non-trivial resemblance to the 
dynamical system on a skew-product studied by Fisher and Schmidt m, although there 
appears to be no direct overlap. 

Lemma 4.3. Let T and p be the transformation and measure corresponding to some tran¬ 
sitive component Ai'j^, the latter of whose existence is gauranteed by Theorem \3 . R 

Let {Csf,M£) be a (possibly countable) seguence of cylinder sets on Li = Ll x Ai'j^, such 
that 

(1) for any x and M, we have {x,M) E {Gsi,M^) for at most finitely many i; and, 

(2) for any x and M, there exists a cylinder {Gs,M) with x E {Cs,M) and such that 
no proper subcylinder {Cs',M) C {Gs,M) is in the sequence {Cs^,M^). 

Let ai > 0 be an associated sequence of positive real numbers so that the function 

I 

is bounded. Here 1_b(-) is the standard indicator function of the set E. 
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Let X E [0,1) be CF-normal and M G Md. Then 

n—l 

f{f\x,M)) = n- 

j=0 



/ dp] (l + o(l)). 


Proof. Prom the second part of Theorem 13.11 we have that for any cylinder set {Cs,M) 
that 

n—l 

= n- 

i=0 



l(Cs,M) dp] {1 + o(l)). 


Thus, if the sequence is finite, then the result follows immediately, so we suppose 

it is countably infinite instead. 

We will refer to D as an anti-cylinder if it equals the difference of a cylinder set with a 
finite number of other cylinder sets. For any anti-cylinder D, we can write the indicator 
function 1/) as a finite sum and difference of indicator functions of cylinder sets, thus we 
have that 


71—1 

^lzi(r(x,M))=n- 

i=0 



(1 + 0 ( 1 )). 


Now let us consider two sequences of functions and fj . We let 


i<j 


so that fj < f and fj < /j+i- To define we first define the set Sj, which will be 
a parition of H. We let Sq = {11}. We then define all the Sj recursively, deriving Sj by 
letting it be the coarsest refinement of Sj-i that includes all cylinder sets Cg with the sum 
of digits in s equal to j. Thus 


51 = {ii\qi],qi]}, 

52 = {n \ (qi] u C[ 2 ]), q 2 ]> qi] \ qi,i]> qi,i]}> 

and so on. Note that Sj always consists of just cylinders and anti-cylinders. For any S G Sj, 
we let a(^s,M) = define 

//= X] «(S,M) • 1(5,M)(-)- 
SeSj,MeM'jy 

Note that /+ > / and /+ > /q^. 

We in fact have that f~ and converge pointwise to / (and hence in norm by dominated 
convergence). For f~ this follows from assumption (1) that any (x,M) E 11 is in finitely 
many of the cylinders {Cg^,M(), and hence for sufficiently large j, f~{x,M) = f{x,M). 
For /{*“, this follows from assumption (2), as for any [x,M) G 11 there exists a cylinder 
set {Cs,M) containing {x,M) such that / is constant on {Cs,M), thus for large j we have 
f^ix,M) = f{x,M). 
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Thus we have, for any j > 0 

n—1 n—1 n—1 

^ M)) < Y, f{f\x, M)) < Y M)) 

2 = 0 2=0 2 = 0 

n • (/- df^ (1 + 0(1)) < I; f{f\x, M))<n- ( /+ dp^ (1 + 0(1)). 

As both integrals above converge in norm to f dp as j tends to infinity, this completes 
the proof. □ 


5. Trigger strings 

Consider three strings s+,s,s_ and three matrices M_|_,M, M_ in some transitive com¬ 
ponent C M-d such that U{s+, M_|_) = M and U{s, M) = M_. With these definitions, 
note that 

fl«+l(C,+„M+) = (C„M) and f l^l(C,,_ , M) = , M_). 

In this way, we can think of going from (s, M) to (s+s,M_|_) as prepending digits to the 
start of {s,M), and likewise we can think of going from {s,M) to (ss_,M) as appending 
digits to the end of {s,M). 

If we denote the resultant strings of (s+, M+), (s, M), (s_, M_) by respectively 

with t = [do;di,d2,---,dm], then it is easy to see that the resultant string of (s+s,M_|_) 
should equal t+t and in fact, all the digits of t appear unchanged and in the same position 
(relative to the end of the string) except perhaps do and di. We say that these digits 
appear in the same relative position. Likewise R{ss-,M) = tt- and all the digits of t 
appear unchanged and in the same position (now relative to the front of the string) except 
perhaps dm-i and dm- Again we say these digits appear in the same relative position. 

We say a substring r appears nicely within the resultant string t if it starts after the 
first digit and ends before the second-to-last digit. Thus, no matter what is prepended or 
appended to s, r will still appear in the corresponding resultant string t+tt-. Although r 
could appear many times in this longer string, there is exactly one copy of r that occupies 
the relative position corresponding to the original copy of r in t. 

We can run this procedure in reverse as well. If we start with a very long string s and 
matrix M G Ad, we could decompose s as with corresponding matrices M_|_, M, M_. 

(In this case, we would refer to {s,M) as a substring of {s,M).) We could obtain other 
decompositions by removing the first digit of s and appending it to the end of s+, and this 
would alter the head of the resultant string t (as well as the matrix M). Similarly we could 
remove the last digit of s and prepend it to s_, altering the tail of t. 

Let t = R{s,M). Suppose we have a proper string r that appears nicely in t. While 
there are potentially many ways of decomposing s = so that r appears nicely in t 

corresponding to the relative position of the original copy of r in t, there is a unique way 
of doing this so that the length of s is minimized. Namely, we remove digits from the head 
or tail of s and append them to or prepend tern to s- until removing any more digits 
would cause r to no longer appear nicely within t at the same relative position. We refer 
to this (s, M) with the length of s minimized as a trigger string for r. If there are exactly 
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k copies of r in i all of which have the same minimal decomposition, we say that (s, M) 
is a trigger string of multiplicity k. For example if the resultant string of ([10], M) was 
[1; 1,1,1,1,1] and the desired string r = [1], then this string has multiplicity 2. 

From these definitions, we see that the total number of times r appears in t is equal (up 
to 0(1)) to the number of trigger strings {s,M) counted with multiplicity that occur in 
(s, M). The 0(1) counts those four possible positions where r could appear in t but not 
nicely. 

6 . Proof of Theorem 11.11 

As we have noted in Remark 12.41 and Section [U it suffices to prove Theorem 11.11 in the 
case where x E [0,1) is CF-normal and M E where is some transitive component 
of A4d- ^ 

Note that Theorem 13.11 applies to T acting on the set 11 x giving us an ergodic, 

invariant measure p on this space, and {x, M) is normal with respect to T and p. 

For the first and largest step of the proof, we want to show that for any finite proper 
string r, this string appears in Mx with a limiting frequency that does not depend on x. 
(However, we will resume throughout the proof that M is a fixed matrix.) In particular, 
we want a constant pr such that 

, . #{0 < i < m : r*(Mx (mod 1)) E Cr} 

(16) hm -= Pr, 

m^oo Tfi 

for all CF-normal x E [0,1). 

Let X = (ai, 02 , 03 ,...), and let Mx = (6o; • ■ •) so that 

Mx = R([oi, 02 , ... ,On],M). (t/([oi, 02 ,..., o^j,M)(r”x)). 

We let i{n) denote the length of i?([oi,a 2 ,... ,On],M). The following two lemmas assume 
that r and Ad^ are fixed and that x is any CF-normal number in [0,1) and M E Ad)^. We 
will also let n = n X Ad^. 

Lemma 6.1. We have for some constant ci > 0 

£{n) = cire(l -F o(l)). 

Proof. We note that if the resultant strings of ([o*], U{[ai, 02 ,..., Oj_i], M)), for i < n, all 
have a zeroth digit that is non-negative, then we clearly have that £(n) is the sum of the 
lengths of these resultant strings. We may call this the “expected” length. However, 
if R([oi],17([oi,a2,...,Oi_i],M)) = [*, 1] and R([ai+i], t/([oi, 02 ,..., Oj], M)) = [-1;*], 
then some of the digits cancel, removing two digits from the expected string length. 
This exception has its own exception: we must be careful if the resultant strings of 
([oj], 17([ai, 02 ,..., Oj-i], M)) and ([oj+i], t/([oi, 02 ,..., o*], M)) are both [—1;1], as these 
will essentially annihilate one another. 

With this in mind, let us consider a set S consisting of pairs (s, M) where s is a string 
and M is in our transitive component Ad^, along with a function g that acts on these pairs 
such that the following holds true. 

(1) All pairs ([ci], M) whose resultant string is [—1; 1] are in S; and ^([ci], M) = 0 here. 

(2) All pairs ([ci],M) where the last digit of the resultant string is greater than 1 are 
in S; and 5 :([ci], M) equals the length of the resultant string. 
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(3) All pairs ([ci, C 2 ,..., cj], M) where the last digit of -R([ci], M) is 1, 
i?([ci],[/([ci,C 2 ,... ,Ci_i],M)) = [-1;1], 2 < i < j - 1, 

and [/([ci,... ,Cj_i],M)) / [—1; 1] are all in S; and g of this pair is the 

length of i?([ci, C 2 ,..., Cj], M) minus the length of -R([cj], [/([ci,..., Cj_i], M)). 

By Lemma 12.31 we see that if (s, M) G S, then s has bounded length. Moreover, the 
cylinder sets {Cs,M) for {s,M) G S form a partition of 12. 

Now let us have a function G on 12 defined so that G{x, M) = g[s^ M) if x G for some 
s with (s, M) G S. By Lemmas 12.21 and 12.31 we see that G is uniformly bounded. Note that 

G= ^ g{s,M) ■ 

(s,M)g5 

and that G{x,M) > 0 for some {x,M). 

From the above definitions we have that i{n) = G(r(x, M)) + 0(1). The 0(1) 

term accounts for small oddities that may occur near the beginning or end of the resultant 
string R{[ai,a 2 , ■ ■ ■ ,an], M). The conditions of Lemma [4.31 hold (trivially so, as none of 
the cylinders in S overlap), and so £{n) = n • (JqG dp) (1 + o(l)). Since p is absolutely 
continuous with respect to the Lebesgue measure on 12 and vice-versa by Remark 13.61 this 
completes the proof. □ 

Lemma 6.2. We have for some constant Cr > 0, 

#{0 < i < i{n) : T^{Mx (mod 1)) G Cr} = Cr-n(l -|- o(l)). 

Proof. By Section[5l we see that #{0 < i < i{n) : T^{Mx (mod 1)) G Cr} equals the number 
of trigger strings for r (counted with multiplicity) that are substrings of [oi, 02 ,..., a,i], up 
to 0(1) to account for those r that may not appear nicely within the resultant string. If 
the trigger strings had uniformly bounded length, this would be equal to the number of 
times r*(x,M) lands in a cylinder set {Cs,M') corresponding to a trigger string {s,M') 
counted with multipllicity. Again this would be up to 0(1) to account for T*(x, M) landing 
in cylinder sets that make use of digits of x beyond the reth digit. We do not know if the 
trigger strings have bounded length (although we strongly suspect this is the case), so we 
must be a bit more careful in our proof, using a sequence of functions Ff~ and which 
will undercount or overcount the number of appearances of the trigger strings. 

As there are countably many strings of finite length, there are countably many trigger 
strings for r, so let us order them as {si, Mi) in such a way so that if (Cs.,, Mj/) C (C^j, Mi) 
then i' >i. 

Define a function F~ on 12 such that Ff~{x,M) counts the number of trigger strings 
(sj. Mi) (with multiplicity) with i < j such that x G C^. and M = Mi. If we let fc, denote 
the multiplicity of {Cs^,Mi), then we have 

i<j 

From our discussion in the first paragraph of the proof, we see that 

(t\x,M)'^ + Oj{l) <#{0<i< £{n) : T\Mx) G Cr}. 

i<n 
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The function F- clearly satisfies the conditions for Lemma 14.31 as it is a finite sum, so 

(17) n ■ F- dp^ (1 + o(l)) < #{0 < i < £{n) : T\Mx) G CJ 

for fixed j. 

If (s, M) is a trigger string for r and the first digit of s is c, then the first digit for this 
appearance of r must be within the first k + 2 digits of R{s,M), where k is the length 
of i?([c],M); otherwise we could remove c from the start of s, and r would still appear 
nicely within the shortened resultant string in the same relative position, contradicting the 
minimality condition for trigger strings. It is possible that if (sj,Mj) is a trigger string for 
r, then there could be a string s_ so that is also a trigger string for r, but these 

two appearances of r cannot start at the same place (relative to the start of the resultant 
string). Therefore, if we let 

K = 2+ max |i?([c],M')| , 


and by Lemma 12.21 the maximum here really does exist, then any given point in can 
lie in at most K cylinders corresponding to trigger strings (counted, as 

usual, with multiplicity). 

We then define another function F^ on 17 by 




K, if (x, M) G {Cs^, Mi) for some i > j, 

F~{x,M), otherwise. 


Now we have that < i < £{n) : T^{Mx) G Cr} is less than the number of times 
T^{x,M) is in a cylinder set corresponding to a trigger string (counted with multiplicity) 
and this in turn is less than 

i<n 

We can write F^ another way, as 

i>j 


where the starred sum runs over all i > j such that (C^., Mj) is not a subset of any other 
(Cs.,,Mi/) with i' > j, and Ki is equal to K minus the value of F~ on this cylinder set. 
These Ki are well-defined because of the ordering we chose for the trigger strings. Since 
the starred sum is over a disjoint union of cylinder sets, the function F^ also satisfies the 
conditions for Lemma 14.31 Therefore 

(18) #{0 < i < £{n) : T\Mx) e Cr} < n ■ (^j_^ F+ dp'^ (1 + o(l)) 

for fixed j. 

Since any point in 17 appears in a uniformly bounded number of cylinder sets, the func¬ 
tions F~ and F)^ converge pointwise as j tends to infinity. And since both functions are 
uniformly bounded by the constant function K, they must converge in norm as well by 
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the dominated convergence theorem. Combining this with (I17p and (|18p gives the desired 
result. □ 

From here we are nearly done. First, let := max{n : i{n) < m}. By Lemma l6.11 

= m/c(l + o(l)), so therefore we have that 

0 < m — < o{m) 

Thus we have that 

#{0 < i < m : T\Nx) £ q} _ #{0 < i < £{r^{m)) : T\Nx) £ q} + o(m) 

m->-oo m ra^oo + o{m) 

and so (jl6h follows from Lemma 16.11 and 16.21 
Now consider the sets 

Em = {Mx : X G [0,1) is CF-normal} and Fi = {x : x G M is CF-normal}. 

We have shown that for any string r there exists a constant pr such that for all y G Em, 
the string r appears in the continued fraction expansion of y with limiting frequency pr, 
even though we do not know what any of these constants pr equal. On the other hand, for 
all strings r and all x G Fi, the limiting frequency of r in the continued fraction expansion 
of X is p{Cr)- Thus, either p^ = p{Cr) for all r and Em is a subset of E oi pr ^ p{Cr) for 
some r and Em is disjoint from E. 

However, E has full Lebesgue measure and Em, being a non-trivial linear fractional 
transformation of a positive measure set, has positive measure, so Em must be a subset of 
E, and the theorem is proved. 


7. Further questions 

In one—admittedly peculiar—sense, the generalization that we have proved of Wall’s 
result is not the natural generalization to make. What makes rational numbers so nice for 
any base b, is that they are eventually periodic. So one could ask the following. 

Suppose X is CF-normal and q and r have eventually periodic continued fraction expan¬ 
sions—that is, they are both quadratic irrationals—with q ^ 0- Must it be true that qx + r 
is CF-normal as well? 

Also, while Theorem o solves Bugeaud’s problem, it does not solve Mendes France’s 
problem. CF-normality is a much stronger condition than CF-simple normality, and our 
proof relies crucially on full CF-normality. So we ask, as Mendes France did: does non-zero 
rational multiplication and rational addition preserve CF-simple normality? 
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